Modeling Speech Melody as Communicative Functions with PENTAtrainer2

نویسندگان

  • Santitham Prom-on
  • Yi Xu
چکیده

This paper presents PENTAtrainer2, a semi-automatic software package written as Praat plug-in integrated with Java programs, and its applications for analysis and synthesis of speech melody as communicative functions. Its core concepts are based on the Parallel Encoding and Target Approximation (PENTA) framework, the quantitative Target Approximation (qTA) model, and the simulated annealing optimization. This integration allows it to globally optimize for underlying pitch targets of specified communicative functions. PENTAtrainer2 consists of three computational tools: Annotation tool for defining communicative functions as parallel layers, Learning tool for globally optimizing pitch target parameters, and Synthesis tool for generating speech melody according to the learned pitch targets. Being both theory-based and trainable, PENTAtrainer2 can serve as an effective tool for basic research in speech prosody.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PENTATrainer2: A hypothesis-driven prosody modeling tool

Prosody is an essential aspect of speech, as it carries both lexical and non-lexical information. A conventional approach to studying speech prosody is to collect and analyze F0 data based on certain hypotheses and then develop a theory based on the observation, which constitutes the final conclusion of the study. This process is however far from complete, as the developed theory has not been a...

متن کامل

Speech melody as articulatorily implemented communicative functions

The understanding of speech melody, i.e., pitch variations related to both tone and intonation, can be improved by simultaneously taking into consideration two basic facts: that the melody conveys communicative meanings, and that it is produced by human articulators. Communicative meanings, as I will argue, are conveyed through a set of separate functions which are realized by an articulatory s...

متن کامل

Modelling Japanese intonation using PENTAtrainer2

This paper presents results from Japanese intonation modelling using PENTAtrainer2, an articulatory synthesiser. Our first aim is to show that PENTA, on which PENTAtrainer2 is based, can achieve high accuracy in predictive synthesis of varying intonation contours. We trained the synthesiser on a 6251-sentence functionally annotated corpus and generated F0 contours for each communicative conditi...

متن کامل

Discovering Underlying Tonal Representations by Computational Modeling: a Case Study of Thai

In the present study we test a computational method for investigating underlying tonal representations. The representation explored is in the form of simple linear functions as ideal pitch targets, with which close-to-natural F0 contours can be computationally generated. The estimation of the pitch targets is done with PENTAtrainer2, a hypothesisdriven prosody-modeling tool that combines functi...

متن کامل

Modeling tone and intonation in Mandarin and English as a process of target approximation.

This paper reports the development of a quantitative target approximation (qTA) model for generating F(0) contours of speech. The qTA model simulates the production of tone and intonation as a process of syllable-synchronized sequential target approximation [Xu, Y. (2005). "Speech melody as articulatorily implemented communicative functions," Speech Commun. 46, 220-251]. It adopts a set of biom...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013